Classifying Blog Posts with Tag Propagation

نویسندگان

  • Bingquan Liu
  • Baoxun Wang
  • Zhen Xu
  • Xiaolong Wang
  • Peng Li
چکیده

Blog tags are usually considered to be supplementary information for blog post classification tasks. Due to the sparsity of tag features, improving performance of classifiers merely using tags is not a trivial operation. This paper presents a blog post classification approach based on the tag propagation strategy. Using a dataset of blog posts gleaned from the Internet, tags of a blog post are propagated from tags of its K nearest neighbors in the blog post dataset. In this case, the original binary feature vectors are changed to real-value ones and the sparsity is reduced. Experimental results show that the classification method based on the tag propagation strategy obtains good performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TagAssist: Automatic Tag Suggestion for Blog Posts

In this paper, we describe a system called TagAssist that provides tag suggestions for new blog posts by utilizing existing tagged posts. The system is able to increase the quality of suggested tags by performing lossless compression over existing tag data. In addition, the system employs a set of metrics to evaluate the quality of a potential tag suggestion. Coupled with the ability for users ...

متن کامل

User Evaluation of a System for Classifying and Displaying Political Viewpoints of Weblogs

This paper presents a Web-based user evaluation of a system for classifying and presenting political viewpoints of blog posts. The system is based on a classification model trained using a supervised learning algorithm, and the data set consists of recent posts from blogs that are self-identified as a liberal or a conservative viewpoint. We first discuss the classification process. Then, with a...

متن کامل

Blog Annotation: From Corpus Analysis to Automatic Tag Suggestion

Nowadays, blogs cover a large audience and they become part of mainstream media. Tags and categories are structural elements of a blog post intended to increase a blog’s visibility and enhance navigation and searching. We suppose that those annotations are made on subjective grounds rather than in a systematic way. This paper presents a 11 million words corpus of blogs posts in French dedicated...

متن کامل

Comment Extraction from Blog Posts and Its Applications to Opinion Mining

Blog posts containing many personal experiences or perspectives toward specific subjects are useful. Blogs allow readers to interact with bloggers by placing comments on specific blog posts. The comments carry viewpoints of readers toward the targets described in the post, or supportive/non-supportive attitude toward the post. Comment extraction is challenging due to that there does not exist a...

متن کامل

Believe Me - We Can Do This! Annotating Persuasive Acts in Blog Text

This paper describes the development of a corpus of blog posts that are annotated for the presence of attempts to persuade and corresponding tactics employed in persuasive messages. We investigate the feasibility of classifying blog posts as persuasive or non-persuasive on the basis of lexical features in the text and the tactics (as provided by human annotators). Annotated tactics provide subs...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. J. of Asian Lang. Proc.

دوره 23  شماره 

صفحات  -

تاریخ انتشار 2015